Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 254974 |
| Missing cells | 3345 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 31.4 MiB |
| Average record size in memory | 129.2 B |
Variable types
| Text | 2 |
|---|---|
| Categorical | 3 |
| Numeric | 11 |
countries_fr has a high cardinality: 536 distinct values | High cardinality |
brands has a high cardinality: 45792 distinct values | High cardinality |
countries_fr is highly imbalanced (82.1%) | Imbalance |
brands has 3280 (1.3%) missing values | Missing |
cholesterol_100g is highly skewed (γ1 = 293.6387501) | Skewed |
code has unique values | Unique |
additives_n has 82104 (32.2%) zeros | Zeros |
energy_100g has 6091 (2.4%) zeros | Zeros |
salt_100g has 36339 (14.3%) zeros | Zeros |
sodium_100g has 36343 (14.3%) zeros | Zeros |
fiber_100g has 124820 (49.0%) zeros | Zeros |
sugars_100g has 50158 (19.7%) zeros | Zeros |
fat_100g has 78545 (30.8%) zeros | Zeros |
saturated_fat_100g has 96736 (37.9%) zeros | Zeros |
cholesterol_100g has 200361 (78.6%) zeros | Zeros |
nutrition_score_uk_100g has 12521 (4.9%) zeros | Zeros |
nutrition_score_fr_100g has 11704 (4.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-06-07 15:45:32.095533 |
|---|---|
| Analysis finished | 2024-06-07 15:45:57.994357 |
| Duration | 25.9 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
code
Text
UNIQUE 
| Distinct | 254974 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 13 |
| Mean length | 12.825159 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3270082 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 254974 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0000000004530 |
|---|---|
| 2nd row | 0000000004559 |
| 3rd row | 0000000016087 |
| 4th row | 0000000016094 |
| 5th row | 0000000016100 |
| Value | Count | Frequency (%) |
| 0000000004530 | 1 | < 0.1% |
| 0000000017497 | 1 | < 0.1% |
| 0000000032070 | 1 | < 0.1% |
| 0000000018449 | 1 | < 0.1% |
| 0000000016087 | 1 | < 0.1% |
| 0000000016094 | 1 | < 0.1% |
| 0000000016100 | 1 | < 0.1% |
| 0000000016117 | 1 | < 0.1% |
| 0000000016124 | 1 | < 0.1% |
| 0000000016193 | 1 | < 0.1% |
| Other values (254964) | 254964 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 836904 | |
| 1 | 361910 | |
| 2 | 306344 | 9.4% |
| 3 | 303811 | 9.3% |
| 7 | 265991 | 8.1% |
| 4 | 260417 | 8.0% |
| 5 | 250756 | 7.7% |
| 8 | 245215 | 7.5% |
| 6 | 237109 | 7.3% |
| 9 | 201625 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3270082 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 836904 | |
| 1 | 361910 | |
| 2 | 306344 | 9.4% |
| 3 | 303811 | 9.3% |
| 7 | 265991 | 8.1% |
| 4 | 260417 | 8.0% |
| 5 | 250756 | 7.7% |
| 8 | 245215 | 7.5% |
| 6 | 237109 | 7.3% |
| 9 | 201625 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3270082 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 836904 | |
| 1 | 361910 | |
| 2 | 306344 | 9.4% |
| 3 | 303811 | 9.3% |
| 7 | 265991 | 8.1% |
| 4 | 260417 | 8.0% |
| 5 | 250756 | 7.7% |
| 8 | 245215 | 7.5% |
| 6 | 237109 | 7.3% |
| 9 | 201625 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3270082 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 836904 | |
| 1 | 361910 | |
| 2 | 306344 | 9.4% |
| 3 | 303811 | 9.3% |
| 7 | 265991 | 8.1% |
| 4 | 260417 | 8.0% |
| 5 | 250756 | 7.7% |
| 8 | 245215 | 7.5% |
| 6 | 237109 | 7.3% |
| 9 | 201625 | 6.2% |
countries_fr
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 536 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 65 |
| Missing (%) | < 0.1% |
| Memory size | 2.5 MiB |
| États-Unis | |
|---|---|
| France | |
| Suisse | 8416 |
| Allemagne | 4464 |
| Espagne | 2924 |
| Other values (531) | 9925 |
Length
| Max length | 211 |
|---|---|
| Median length | 10 |
| Mean length | 8.9393627 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2278724 |
|---|---|
| Distinct characters | 120 |
| Distinct categories | 9 ? |
| Distinct scripts | 7 ? |
| Distinct blocks | 8 ? |
Unique
| Unique | 298 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | États-Unis |
|---|---|
| 2nd row | États-Unis |
| 3rd row | États-Unis |
| 4th row | États-Unis |
| 5th row | États-Unis |
Common Values
| Value | Count | Frequency (%) |
| États-Unis | 167716 | |
| France | 61464 | 24.1% |
| Suisse | 8416 | 3.3% |
| Allemagne | 4464 | 1.8% |
| Espagne | 2924 | 1.1% |
| Royaume-Uni | 1620 | 0.6% |
| France,Suisse | 1066 | 0.4% |
| Russie | 805 | 0.3% |
| Belgique | 598 | 0.2% |
| Australie | 502 | 0.2% |
| Other values (526) | 5334 | 2.1% |
Length
| Value | Count | Frequency (%) |
| états-unis | 167717 | |
| france | 61464 | 24.1% |
| suisse | 8416 | 3.3% |
| allemagne | 4464 | 1.7% |
| espagne | 2924 | 1.1% |
| royaume-uni | 1620 | 0.6% |
| france,suisse | 1066 | 0.4% |
| russie | 805 | 0.3% |
| belgique | 598 | 0.2% |
| australie | 502 | 0.2% |
| Other values (557) | 5588 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 362874 | |
| t | 338116 | |
| a | 247633 | |
| n | 245151 | |
| i | 184954 | |
| - | 170765 | |
| U | 170232 | |
| É | 168095 | |
| e | 97677 | 4.3% |
| r | 66956 | 2.9% |
| Other values (110) | 226271 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1671706 | |
| Uppercase Letter | 430706 | 18.9% |
| Dash Punctuation | 170765 | 7.5% |
| Other Punctuation | 5118 | 0.2% |
| Space Separator | 255 | < 0.1% |
| Other Letter | 168 | < 0.1% |
| Decimal Number | 3 | < 0.1% |
| Nonspacing Mark | 2 | < 0.1% |
| Spacing Mark | 1 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| ا | 18 | 10.7% |
| ل | 16 | 9.5% |
| ة | 13 | 7.7% |
| ع | 11 | 6.5% |
| س | 10 | 6.0% |
| د | 8 | 4.8% |
| م | 8 | 4.8% |
| ي | 7 | 4.2% |
| ن | 7 | 4.2% |
| و | 7 | 4.2% |
| Other values (33) | 63 |
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 362874 | |
| t | 338116 | |
| a | 247633 | |
| n | 245151 | |
| i | 184954 | |
| e | 97677 | 5.8% |
| r | 66956 | 4.0% |
| c | 64938 | 3.9% |
| u | 16859 | 1.0% |
| l | 14114 | 0.8% |
| Other values (30) | 32434 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 170232 | |
| É | 168095 | |
| F | 64627 | 15.0% |
| S | 10455 | 2.4% |
| A | 5948 | 1.4% |
| E | 3247 | 0.8% |
| R | 3166 | 0.7% |
| B | 1742 | 0.4% |
| P | 934 | 0.2% |
| I | 569 | 0.1% |
| Other values (17) | 1691 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4985 | |
| : | 131 | 2.6% |
| ' | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 6 | 1 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ั | 1 | |
| ี | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 170765 |
Space Separator
| Value | Count | Frequency (%) |
| 255 |
Spacing Mark
| Value | Count | Frequency (%) |
| ा | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2102403 | |
| Common | 176141 | 7.7% |
| Arabic | 121 | < 0.1% |
| Thai | 38 | < 0.1% |
| Cyrillic | 9 | < 0.1% |
| Han | 8 | < 0.1% |
| Devanagari | 4 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 362874 | |
| t | 338116 | |
| a | 247633 | |
| n | 245151 | |
| i | 184954 | |
| U | 170232 | |
| É | 168095 | |
| e | 97677 | 4.6% |
| r | 66956 | 3.2% |
| c | 64938 | 3.1% |
| Other values (50) | 155777 |
Thai
| Value | Count | Frequency (%) |
| ร | 5 | |
| เ | 4 | 10.5% |
| อ | 3 | 7.9% |
| า | 3 | 7.9% |
| ท | 3 | 7.9% |
| ย | 2 | 5.3% |
| ส | 2 | 5.3% |
| ศ | 2 | 5.3% |
| ะ | 2 | 5.3% |
| ป | 2 | 5.3% |
| Other values (10) | 10 |
Arabic
| Value | Count | Frequency (%) |
| ا | 18 | |
| ل | 16 | |
| ة | 13 | |
| ع | 11 | |
| س | 10 | |
| د | 8 | |
| م | 8 | |
| ي | 7 | 5.8% |
| ن | 7 | 5.8% |
| و | 7 | 5.8% |
| Other values (8) | 16 |
Common
| Value | Count | Frequency (%) |
| - | 170765 | |
| , | 4985 | 2.8% |
| 255 | 0.1% | |
| : | 131 | 0.1% |
| 7 | 2 | < 0.1% |
| ' | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
Cyrillic
| Value | Count | Frequency (%) |
| а | 3 | |
| т | 1 | 11.1% |
| н | 1 | 11.1% |
| х | 1 | 11.1% |
| с | 1 | 11.1% |
| з | 1 | 11.1% |
| К | 1 | 11.1% |
Han
| Value | Count | Frequency (%) |
| 日 | 2 | |
| 本 | 2 | |
| 港 | 2 | |
| 香 | 2 |
Devanagari
| Value | Count | Frequency (%) |
| भ | 1 | |
| ा | 1 | |
| र | 1 | |
| त | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2109731 | |
| None | 168804 | 7.4% |
| Arabic | 121 | < 0.1% |
| Thai | 38 | < 0.1% |
| IPA Ext | 9 | < 0.1% |
| Cyrillic | 9 | < 0.1% |
| CJK | 8 | < 0.1% |
| Devanagari | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 362874 | |
| t | 338116 | |
| a | 247633 | |
| n | 245151 | |
| i | 184954 | |
| - | 170765 | |
| U | 170232 | |
| e | 97677 | 4.6% |
| r | 66956 | 3.2% |
| c | 64938 | 3.1% |
| Other values (47) | 160435 |
None
| Value | Count | Frequency (%) |
| É | 168095 | |
| é | 464 | 0.3% |
| è | 178 | 0.1% |
| ï | 38 | < 0.1% |
| ç | 17 | < 0.1% |
| ë | 8 | < 0.1% |
| ô | 2 | < 0.1% |
| ê | 1 | < 0.1% |
| Î | 1 | < 0.1% |
Arabic
| Value | Count | Frequency (%) |
| ا | 18 | |
| ل | 16 | |
| ة | 13 | |
| ع | 11 | |
| س | 10 | |
| د | 8 | |
| م | 8 | |
| ي | 7 | 5.8% |
| ن | 7 | 5.8% |
| و | 7 | 5.8% |
| Other values (8) | 16 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 9 |
Thai
| Value | Count | Frequency (%) |
| ร | 5 | |
| เ | 4 | 10.5% |
| อ | 3 | 7.9% |
| า | 3 | 7.9% |
| ท | 3 | 7.9% |
| ย | 2 | 5.3% |
| ส | 2 | 5.3% |
| ศ | 2 | 5.3% |
| ะ | 2 | 5.3% |
| ป | 2 | 5.3% |
| Other values (10) | 10 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 3 | |
| т | 1 | 11.1% |
| н | 1 | 11.1% |
| х | 1 | 11.1% |
| с | 1 | 11.1% |
| з | 1 | 11.1% |
| К | 1 | 11.1% |
CJK
| Value | Count | Frequency (%) |
| 日 | 2 | |
| 本 | 2 | |
| 港 | 2 | |
| 香 | 2 |
Devanagari
| Value | Count | Frequency (%) |
| भ | 1 | |
| ा | 1 | |
| र | 1 | |
| त | 1 |
product_name
Text
| Distinct | 184576 |
|---|---|
| Distinct (%) | 72.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.9 MiB |
Length
| Max length | 234 |
|---|---|
| Median length | 161 |
| Mean length | 26.842211 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6844066 |
|---|---|
| Distinct characters | 607 |
| Distinct categories | 20 ? |
| Distinct scripts | 12 ? |
| Distinct blocks | 20 ? |
Unique
| Unique | 163912 ? |
|---|---|
| Unique (%) | 64.3% |
Sample
| 1st row | Banana Chips Sweetened (Whole) |
|---|---|
| 2nd row | Peanuts |
| 3rd row | Organic Salted Nut Mix |
| 4th row | Organic Polenta |
| 5th row | Breadshop Honey Gone Nuts Granola |
| Value | Count | Frequency (%) |
| 24307 | 2.3% | |
| de | 20079 | 1.9% |
| chocolate | 11169 | 1.1% |
| cheese | 10274 | 1.0% |
| sauce | 9975 | 1.0% |
| organic | 9105 | 0.9% |
| with | 8089 | 0.8% |
| mix | 7066 | 0.7% |
| au | 6628 | 0.6% |
| cream | 5938 | 0.6% |
| Other values (43718) | 936424 |
Most occurring characters
| Value | Count | Frequency (%) |
| 797591 | 11.7% | |
| e | 679702 | 9.9% |
| a | 519009 | 7.6% |
| r | 398308 | 5.8% |
| i | 390348 | 5.7% |
| o | 354039 | 5.2% |
| t | 319760 | 4.7% |
| n | 306049 | 4.5% |
| s | 291079 | 4.3% |
| l | 280256 | 4.1% |
| Other values (597) | 2507925 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4948730 | |
| Uppercase Letter | 904442 | 13.2% |
| Space Separator | 797635 | 11.7% |
| Other Punctuation | 132464 | 1.9% |
| Decimal Number | 35804 | 0.5% |
| Dash Punctuation | 13463 | 0.2% |
| Close Punctuation | 4678 | 0.1% |
| Open Punctuation | 4676 | 0.1% |
| Math Symbol | 1068 | < 0.1% |
| Other Letter | 467 | < 0.1% |
| Other values (10) | 639 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| ל | 11 | 2.4% |
| ו | 9 | 1.9% |
| º | 8 | 1.7% |
| י | 8 | 1.7% |
| ร | 8 | 1.7% |
| อ | 7 | 1.5% |
| ק | 7 | 1.5% |
| เ | 6 | 1.3% |
| ل | 6 | 1.3% |
| น | 6 | 1.3% |
| Other values (257) | 391 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 679702 | |
| a | 519009 | |
| r | 398308 | 8.0% |
| i | 390348 | 7.9% |
| o | 354039 | 7.2% |
| t | 319760 | 6.5% |
| n | 306049 | 6.2% |
| s | 291079 | 5.9% |
| l | 280256 | 5.7% |
| u | 212856 | 4.3% |
| Other values (136) | 1197324 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 142429 | |
| S | 110619 | |
| P | 78402 | 8.7% |
| B | 73567 | 8.1% |
| M | 57888 | 6.4% |
| F | 44496 | 4.9% |
| G | 37689 | 4.2% |
| T | 37267 | 4.1% |
| O | 34702 | 3.8% |
| R | 33710 | 3.7% |
| Other values (89) | 253673 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 83639 | |
| & | 19841 | 15.0% |
| ' | 13559 | 10.2% |
| % | 7182 | 5.4% |
| . | 3243 | 2.4% |
| ; | 1856 | 1.4% |
| ! | 1197 | 0.9% |
| / | 724 | 0.5% |
| : | 649 | 0.5% |
| " | 170 | 0.1% |
| Other values (11) | 404 | 0.3% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ิ | 7 | |
| ้ | 6 | |
| ่ | 4 | |
| ́ | 4 | |
| ์ | 4 | |
| ี | 4 | |
| ︎ | 3 | |
| ️ | 1 | 2.7% |
| ̀ | 1 | 2.7% |
| ̈ | 1 | 2.7% |
| Other values (2) | 2 | 5.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10916 | |
| 1 | 6689 | |
| 2 | 4939 | |
| 5 | 3045 | 8.5% |
| 4 | 2631 | 7.3% |
| 3 | 2631 | 7.3% |
| 6 | 1640 | 4.6% |
| 8 | 1504 | 4.2% |
| 7 | 1147 | 3.2% |
| 9 | 662 | 1.8% |
Other Symbol
| Value | Count | Frequency (%) |
| ® | 90 | |
| ° | 81 | |
| ♥ | 6 | 3.1% |
| № | 6 | 3.1% |
| ™ | 5 | 2.6% |
| © | 2 | 1.0% |
| ℅ | 2 | 1.0% |
| 🅫 | 1 | 0.5% |
| ❤ | 1 | 0.5% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1045 | |
| | | 8 | 0.7% |
| = | 6 | 0.6% |
| ~ | 3 | 0.3% |
| > | 2 | 0.2% |
| < | 2 | 0.2% |
| ≤ | 1 | 0.1% |
| × | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4628 | |
| [ | 37 | 0.8% |
| { | 7 | 0.1% |
| „ | 3 | 0.1% |
| ‚ | 1 | < 0.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ー | 4 | |
| ー | 2 | |
| ゙ | 2 | |
| ゚ | 1 | 11.1% |
Space Separator
| Value | Count | Frequency (%) |
| 797591 | ||
| 43 | < 0.1% | |
| 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13451 | |
| – | 7 | 0.1% |
| — | 5 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4636 | |
| ] | 36 | 0.8% |
| } | 6 | 0.1% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 136 | |
| “ | 6 | 4.2% |
| ‘ | 1 | 0.7% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 135 | |
| ’ | 10 | 6.8% |
| ” | 3 | 2.0% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 45 | |
| € | 11 | 19.3% |
| ¢ | 1 | 1.8% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 16 | |
| ´ | 9 | |
| ¨ | 1 | 3.8% |
Control
| Value | Count | Frequency (%) |
| | 9 | |
| | 6 | |
| | 2 | 11.8% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7 |
Other Number
| Value | Count | Frequency (%) |
| ² | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5834074 | |
| Common | 990389 | 14.5% |
| Cyrillic | 18855 | 0.3% |
| Greek | 251 | < 0.1% |
| Han | 144 | < 0.1% |
| Thai | 115 | < 0.1% |
| Hebrew | 83 | < 0.1% |
| Hiragana | 55 | < 0.1% |
| Katakana | 36 | < 0.1% |
| Hangul | 34 | < 0.1% |
| Other values (2) | 30 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 679702 | 11.7% |
| a | 519009 | 8.9% |
| r | 398308 | 6.8% |
| i | 390348 | 6.7% |
| o | 354039 | 6.1% |
| t | 319760 | 5.5% |
| n | 306049 | 5.2% |
| s | 291079 | 5.0% |
| l | 280256 | 4.8% |
| u | 212856 | 3.6% |
| Other values (132) | 2082668 |
Han
| Value | Count | Frequency (%) |
| 味 | 3 | 2.1% |
| 醤 | 3 | 2.1% |
| 油 | 3 | 2.1% |
| 奶 | 3 | 2.1% |
| 葡 | 2 | 1.4% |
| 萄 | 2 | 1.4% |
| 生 | 2 | 1.4% |
| 豆 | 2 | 1.4% |
| 紅 | 2 | 1.4% |
| 乾 | 2 | 1.4% |
| Other values (101) | 120 |
Common
| Value | Count | Frequency (%) |
| 797591 | ||
| , | 83639 | 8.4% |
| & | 19841 | 2.0% |
| ' | 13559 | 1.4% |
| - | 13451 | 1.4% |
| 0 | 10916 | 1.1% |
| % | 7182 | 0.7% |
| 1 | 6689 | 0.7% |
| 2 | 4939 | 0.5% |
| ) | 4636 | 0.5% |
| Other values (72) | 27946 | 2.8% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 2058 | 10.9% |
| а | 1768 | 9.4% |
| е | 1344 | 7.1% |
| н | 1319 | 7.0% |
| и | 1131 | 6.0% |
| р | 1090 | 5.8% |
| с | 1037 | 5.5% |
| к | 971 | 5.1% |
| л | 912 | 4.8% |
| т | 679 | 3.6% |
| Other values (51) | 6546 |
Greek
| Value | Count | Frequency (%) |
| α | 23 | 9.2% |
| ο | 21 | 8.4% |
| ι | 18 | 7.2% |
| ρ | 15 | 6.0% |
| λ | 14 | 5.6% |
| κ | 10 | 4.0% |
| τ | 10 | 4.0% |
| ς | 8 | 3.2% |
| ά | 8 | 3.2% |
| μ | 8 | 3.2% |
| Other values (33) | 116 |
Thai
| Value | Count | Frequency (%) |
| ร | 8 | 7.0% |
| ิ | 7 | 6.1% |
| อ | 7 | 6.1% |
| เ | 6 | 5.2% |
| ้ | 6 | 5.2% |
| น | 6 | 5.2% |
| ล | 5 | 4.3% |
| ก | 5 | 4.3% |
| ว | 5 | 4.3% |
| ส | 5 | 4.3% |
| Other values (27) | 55 |
Hiragana
| Value | Count | Frequency (%) |
| の | 5 | 9.1% |
| ん | 5 | 9.1% |
| し | 3 | 5.5% |
| う | 3 | 5.5% |
| ど | 3 | 5.5% |
| ら | 3 | 5.5% |
| め | 2 | 3.6% |
| だ | 2 | 3.6% |
| そ | 2 | 3.6% |
| ば | 2 | 3.6% |
| Other values (22) | 25 |
Hangul
| Value | Count | Frequency (%) |
| 차 | 2 | 5.9% |
| 튼 | 2 | 5.9% |
| 자 | 2 | 5.9% |
| 장 | 1 | 2.9% |
| 무 | 1 | 2.9% |
| 쌀 | 1 | 2.9% |
| 떡 | 1 | 2.9% |
| 칠 | 1 | 2.9% |
| 성 | 1 | 2.9% |
| 사 | 1 | 2.9% |
| Other values (21) | 21 |
Katakana
| Value | Count | Frequency (%) |
| ン | 3 | 8.3% |
| グ | 2 | 5.6% |
| ミ | 2 | 5.6% |
| ロ | 2 | 5.6% |
| チ | 2 | 5.6% |
| ク | 1 | 2.8% |
| レ | 1 | 2.8% |
| ズ | 1 | 2.8% |
| レ | 1 | 2.8% |
| セ | 1 | 2.8% |
| Other values (20) | 20 |
Hebrew
| Value | Count | Frequency (%) |
| ל | 11 | |
| ו | 9 | 10.8% |
| י | 8 | 9.6% |
| ק | 7 | 8.4% |
| פ | 6 | 7.2% |
| א | 5 | 6.0% |
| ר | 4 | 4.8% |
| ה | 3 | 3.6% |
| ב | 3 | 3.6% |
| מ | 3 | 3.6% |
| Other values (13) | 24 |
Arabic
| Value | Count | Frequency (%) |
| ل | 6 | |
| ي | 3 | |
| م | 2 | 10.0% |
| س | 2 | 10.0% |
| ب | 2 | 10.0% |
| ح | 1 | 5.0% |
| ق | 1 | 5.0% |
| ا | 1 | 5.0% |
| د | 1 | 5.0% |
| خ | 1 | 5.0% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ︎ | 3 | |
| ️ | 1 | 10.0% |
| ̀ | 1 | 10.0% |
| ̈ | 1 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6772640 | |
| None | 51984 | 0.8% |
| Cyrillic | 18855 | 0.3% |
| CJK | 144 | < 0.1% |
| Thai | 115 | < 0.1% |
| Hebrew | 83 | < 0.1% |
| Punctuation | 55 | < 0.1% |
| Hiragana | 55 | < 0.1% |
| Katakana | 35 | < 0.1% |
| Hangul | 34 | < 0.1% |
| Other values (10) | 66 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 797591 | 11.8% | |
| e | 679702 | 10.0% |
| a | 519009 | 7.7% |
| r | 398308 | 5.9% |
| i | 390348 | 5.8% |
| o | 354039 | 5.2% |
| t | 319760 | 4.7% |
| n | 306049 | 4.5% |
| s | 291079 | 4.3% |
| l | 280256 | 4.1% |
| Other values (84) | 2436499 |
None
| Value | Count | Frequency (%) |
| é | 28969 | |
| à | 4999 | 9.6% |
| è | 4378 | 8.4% |
| â | 2386 | 4.6% |
| ê | 1434 | 2.8% |
| û | 1248 | 2.4% |
| ü | 964 | 1.9% |
| ä | 699 | 1.3% |
| ç | 680 | 1.3% |
| ô | 672 | 1.3% |
| Other values (145) | 5555 | 10.7% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 2058 | 10.9% |
| а | 1768 | 9.4% |
| е | 1344 | 7.1% |
| н | 1319 | 7.0% |
| и | 1131 | 6.0% |
| р | 1090 | 5.8% |
| с | 1037 | 5.5% |
| к | 971 | 5.1% |
| л | 912 | 4.8% |
| т | 679 | 3.6% |
| Other values (51) | 6546 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 11 |
Hebrew
| Value | Count | Frequency (%) |
| ל | 11 | |
| ו | 9 | 10.8% |
| י | 8 | 9.6% |
| ק | 7 | 8.4% |
| פ | 6 | 7.2% |
| א | 5 | 6.0% |
| ר | 4 | 4.8% |
| ה | 3 | 3.6% |
| ב | 3 | 3.6% |
| מ | 3 | 3.6% |
| Other values (13) | 24 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 10 | |
| … | 9 | |
| • | 9 | |
| – | 7 | |
| “ | 6 | |
| — | 5 | |
| „ | 3 | 5.5% |
| ” | 3 | 5.5% |
| ‘ | 1 | 1.8% |
| ‚ | 1 | 1.8% |
Thai
| Value | Count | Frequency (%) |
| ร | 8 | 7.0% |
| ิ | 7 | 6.1% |
| อ | 7 | 6.1% |
| เ | 6 | 5.2% |
| ้ | 6 | 5.2% |
| น | 6 | 5.2% |
| ล | 5 | 4.3% |
| ก | 5 | 4.3% |
| ว | 5 | 4.3% |
| ส | 5 | 4.3% |
| Other values (27) | 55 |
Misc Symbols
| Value | Count | Frequency (%) |
| ♥ | 6 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| № | 6 | |
| ™ | 5 | |
| ℅ | 2 | 15.4% |
Arabic
| Value | Count | Frequency (%) |
| ل | 6 | |
| ي | 3 | |
| م | 2 | 10.0% |
| س | 2 | 10.0% |
| ب | 2 | 10.0% |
| ح | 1 | 5.0% |
| ق | 1 | 5.0% |
| ا | 1 | 5.0% |
| د | 1 | 5.0% |
| خ | 1 | 5.0% |
Hiragana
| Value | Count | Frequency (%) |
| の | 5 | 9.1% |
| ん | 5 | 9.1% |
| し | 3 | 5.5% |
| う | 3 | 5.5% |
| ど | 3 | 5.5% |
| ら | 3 | 5.5% |
| め | 2 | 3.6% |
| だ | 2 | 3.6% |
| そ | 2 | 3.6% |
| ば | 2 | 3.6% |
| Other values (22) | 25 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ̀ | 1 | 16.7% |
| ̈ | 1 | 16.7% |
Katakana
| Value | Count | Frequency (%) |
| ー | 4 | 11.4% |
| ン | 3 | 8.6% |
| グ | 2 | 5.7% |
| ミ | 2 | 5.7% |
| ロ | 2 | 5.7% |
| チ | 2 | 5.7% |
| レ | 1 | 2.9% |
| ズ | 1 | 2.9% |
| リ | 1 | 2.9% |
| ド | 1 | 2.9% |
| Other values (16) | 16 |
CJK
| Value | Count | Frequency (%) |
| 味 | 3 | 2.1% |
| 醤 | 3 | 2.1% |
| 油 | 3 | 2.1% |
| 奶 | 3 | 2.1% |
| 葡 | 2 | 1.4% |
| 萄 | 2 | 1.4% |
| 生 | 2 | 1.4% |
| 豆 | 2 | 1.4% |
| 紅 | 2 | 1.4% |
| 乾 | 2 | 1.4% |
| Other values (101) | 120 |
VS
| Value | Count | Frequency (%) |
| ︎ | 3 | |
| ️ | 1 | 25.0% |
Hangul
| Value | Count | Frequency (%) |
| 차 | 2 | 5.9% |
| 튼 | 2 | 5.9% |
| 자 | 2 | 5.9% |
| 장 | 1 | 2.9% |
| 무 | 1 | 2.9% |
| 쌀 | 1 | 2.9% |
| 떡 | 1 | 2.9% |
| 칠 | 1 | 2.9% |
| 성 | 1 | 2.9% |
| 사 | 1 | 2.9% |
| Other values (21) | 21 |
Enclosed Alphanum Sup
| Value | Count | Frequency (%) |
| 🅫 | 1 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ề | 1 | |
| ớ | 1 | |
| ắ | 1 |
Math Operators
| Value | Count | Frequency (%) |
| ≤ | 1 |
Dingbats
| Value | Count | Frequency (%) |
| ❤ | 1 |
brands
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 45792 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 3280 |
| Missing (%) | 1.3% |
| Memory size | 5.4 MiB |
| Carrefour | 2504 |
|---|---|
| Meijer | 1945 |
| Auchan | 1939 |
| U | 1754 |
| Kroger | 1632 |
| Other values (45787) |
Length
| Max length | 228 |
|---|---|
| Median length | 155 |
| Mean length | 15.766129 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3968240 |
|---|---|
| Distinct characters | 393 |
| Distinct categories | 16 ? |
| Distinct scripts | 11 ? |
| Distinct blocks | 13 ? |
Unique
| Unique | 25210 ? |
|---|---|
| Unique (%) | 10.0% |
Sample
| 1st row | Torn & Glasser |
|---|---|
| 2nd row | Grizzlies |
| 3rd row | Bob's Red Mill |
| 4th row | Unfi |
| 5th row | Lundberg |
Common Values
| Value | Count | Frequency (%) |
| Carrefour | 2504 | 1.0% |
| Meijer | 1945 | 0.8% |
| Auchan | 1939 | 0.8% |
| U | 1754 | 0.7% |
| Kroger | 1632 | 0.6% |
| Leader Price | 1412 | 0.6% |
| Spartan | 1323 | 0.5% |
| Ahold | 1320 | 0.5% |
| Casino | 1293 | 0.5% |
| Roundy's | 1252 | 0.5% |
| Other values (45782) | 235320 | |
| (Missing) | 3280 | 1.3% |
Length
| Value | Count | Frequency (%) |
| inc | 34189 | 5.8% |
| foods | 14040 | 2.4% |
| llc | 8694 | 1.5% |
| company | 8525 | 1.4% |
| 7636 | 1.3% | |
| co | 7366 | 1.2% |
| food | 6890 | 1.2% |
| the | 4678 | 0.8% |
| market | 4023 | 0.7% |
| stores | 3895 | 0.7% |
| Other values (30953) | 494067 |
Most occurring characters
| Value | Count | Frequency (%) |
| 404219 | 10.2% | |
| e | 330918 | 8.3% |
| a | 286274 | 7.2% |
| r | 259350 | 6.5% |
| o | 256665 | 6.5% |
| n | 220687 | 5.6% |
| i | 199614 | 5.0% |
| s | 175776 | 4.4% |
| t | 152876 | 3.9% |
| l | 146620 | 3.7% |
| Other values (383) | 1535241 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2774702 | |
| Uppercase Letter | 615265 | 15.5% |
| Space Separator | 404219 | 10.2% |
| Other Punctuation | 155211 | 3.9% |
| Dash Punctuation | 9795 | 0.2% |
| Decimal Number | 6085 | 0.2% |
| Open Punctuation | 1118 | < 0.1% |
| Close Punctuation | 1117 | < 0.1% |
| Other Letter | 282 | < 0.1% |
| Math Symbol | 235 | < 0.1% |
| Other values (6) | 211 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| า | 22 | 7.8% |
| ม | 21 | 7.4% |
| º | 11 | 3.9% |
| ต | 11 | 3.9% |
| แ | 11 | 3.9% |
| ร | 11 | 3.9% |
| ا | 7 | 2.5% |
| ك | 6 | 2.1% |
| น | 6 | 2.1% |
| ฟ | 5 | 1.8% |
| Other values (121) | 171 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 330918 | |
| a | 286274 | |
| r | 259350 | |
| o | 256665 | |
| n | 220687 | 8.0% |
| i | 199614 | 7.2% |
| s | 175776 | 6.3% |
| t | 152876 | 5.5% |
| l | 146620 | 5.3% |
| c | 120363 | 4.3% |
| Other values (105) | 625559 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 66028 | 10.7% |
| S | 53148 | 8.6% |
| F | 46802 | 7.6% |
| I | 44914 | 7.3% |
| M | 43380 | 7.1% |
| B | 39192 | 6.4% |
| L | 36374 | 5.9% |
| P | 30741 | 5.0% |
| A | 26881 | 4.4% |
| T | 26569 | 4.3% |
| Other values (81) | 201236 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 54488 | |
| , | 51423 | |
| ' | 27090 | |
| / | 10402 | 6.7% |
| & | 7961 | 5.1% |
| : | 2540 | 1.6% |
| ! | 1039 | 0.7% |
| " | 150 | 0.1% |
| · | 26 | < 0.1% |
| @ | 22 | < 0.1% |
| Other values (9) | 70 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1212 | |
| 5 | 1130 | |
| 6 | 1018 | |
| 2 | 559 | |
| 1 | 540 | |
| 0 | 477 | 7.8% |
| 7 | 442 | 7.3% |
| 4 | 276 | 4.5% |
| 9 | 236 | 3.9% |
| 8 | 195 | 3.2% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ่ | 12 | |
| ้ | 6 | |
| ี | 4 | 13.8% |
| ิ | 3 | 10.3% |
| ์ | 2 | 6.9% |
| ู | 1 | 3.4% |
| ั | 1 | 3.4% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 232 | |
| | | 2 | 0.9% |
| ~ | 1 | 0.4% |
Other Symbol
| Value | Count | Frequency (%) |
| ® | 19 | |
| № | 4 | 16.7% |
| ° | 1 | 4.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9789 | |
| — | 6 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1108 | |
| [ | 10 | 0.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1107 | |
| ] | 10 | 0.9% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 71 | |
| € | 8 | 10.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 31 | |
| ’ | 6 | 16.2% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 7 | |
| ` | 2 | 22.2% |
Space Separator
| Value | Count | Frequency (%) |
| 404219 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 33 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3382248 | |
| Common | 577962 | 14.6% |
| Cyrillic | 7681 | 0.2% |
| Thai | 161 | < 0.1% |
| Han | 70 | < 0.1% |
| Greek | 49 | < 0.1% |
| Arabic | 27 | < 0.1% |
| Hebrew | 19 | < 0.1% |
| Hangul | 14 | < 0.1% |
| Katakana | 8 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 330918 | 9.8% |
| a | 286274 | 8.5% |
| r | 259350 | 7.7% |
| o | 256665 | 7.6% |
| n | 220687 | 6.5% |
| i | 199614 | 5.9% |
| s | 175776 | 5.2% |
| t | 152876 | 4.5% |
| l | 146620 | 4.3% |
| c | 120363 | 3.6% |
| Other values (107) | 1233105 |
Cyrillic
| Value | Count | Frequency (%) |
| о | 672 | 8.7% |
| а | 659 | 8.6% |
| е | 597 | 7.8% |
| р | 501 | 6.5% |
| н | 480 | 6.2% |
| и | 443 | 5.8% |
| к | 398 | 5.2% |
| с | 358 | 4.7% |
| л | 268 | 3.5% |
| т | 261 | 3.4% |
| Other values (52) | 3044 |
Han
| Value | Count | Frequency (%) |
| 山 | 2 | 2.9% |
| 可 | 2 | 2.9% |
| 田 | 2 | 2.9% |
| 生 | 2 | 2.9% |
| 日 | 2 | 2.9% |
| 口 | 2 | 2.9% |
| 鎌 | 2 | 2.9% |
| 今 | 2 | 2.9% |
| 郎 | 2 | 2.9% |
| 麥 | 2 | 2.9% |
| Other values (50) | 50 |
Common
| Value | Count | Frequency (%) |
| 404219 | ||
| . | 54488 | 9.4% |
| , | 51423 | 8.9% |
| ' | 27090 | 4.7% |
| / | 10402 | 1.8% |
| - | 9789 | 1.7% |
| & | 7961 | 1.4% |
| : | 2540 | 0.4% |
| 3 | 1212 | 0.2% |
| 5 | 1130 | 0.2% |
| Other values (39) | 7708 | 1.3% |
Thai
| Value | Count | Frequency (%) |
| า | 22 | |
| ม | 21 | |
| ่ | 12 | 7.5% |
| ต | 11 | 6.8% |
| แ | 11 | 6.8% |
| ร | 11 | 6.8% |
| น | 6 | 3.7% |
| ้ | 6 | 3.7% |
| ฟ | 5 | 3.1% |
| อ | 5 | 3.1% |
| Other values (24) | 51 |
Greek
| Value | Count | Frequency (%) |
| ν | 8 | |
| ω | 4 | 8.2% |
| η | 4 | 8.2% |
| Κ | 2 | 4.1% |
| τ | 2 | 4.1% |
| Ι | 2 | 4.1% |
| ί | 2 | 4.1% |
| Ν | 2 | 4.1% |
| Ο | 2 | 4.1% |
| Υ | 2 | 4.1% |
| Other values (18) | 19 |
Hebrew
| Value | Count | Frequency (%) |
| ק | 3 | |
| ו | 3 | |
| ה | 2 | |
| פ | 2 | |
| מ | 1 | 5.3% |
| ס | 1 | 5.3% |
| י | 1 | 5.3% |
| צ | 1 | 5.3% |
| ת | 1 | 5.3% |
| ל | 1 | 5.3% |
| Other values (3) | 3 |
Hangul
| Value | Count | Frequency (%) |
| 농 | 2 | |
| 심 | 2 | |
| 빙 | 1 | |
| 그 | 1 | |
| 레 | 1 | |
| 연 | 1 | |
| 은 | 1 | |
| 자 | 1 | |
| 학 | 1 | |
| 송 | 1 | |
| Other values (2) | 2 |
Arabic
| Value | Count | Frequency (%) |
| ا | 7 | |
| ك | 6 | |
| و | 5 | |
| ل | 3 | |
| ن | 2 | 7.4% |
| د | 1 | 3.7% |
| ي | 1 | 3.7% |
| ف | 1 | 3.7% |
| ص | 1 | 3.7% |
Katakana
| Value | Count | Frequency (%) |
| ア | 1 | |
| リ | 1 | |
| グ | 1 | |
| ン | 1 | |
| ミ | 1 | |
| ナ | 1 | |
| ロ | 1 | |
| オ | 1 |
Hiragana
| Value | Count | Frequency (%) |
| い | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3945763 | |
| None | 14466 | 0.4% |
| Cyrillic | 7681 | 0.2% |
| Thai | 161 | < 0.1% |
| CJK | 70 | < 0.1% |
| Arabic | 28 | < 0.1% |
| Hebrew | 19 | < 0.1% |
| Punctuation | 17 | < 0.1% |
| Hangul | 14 | < 0.1% |
| Currency Symbols | 8 | < 0.1% |
| Other values (3) | 13 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 404219 | 10.2% | |
| e | 330918 | 8.4% |
| a | 286274 | 7.3% |
| r | 259350 | 6.6% |
| o | 256665 | 6.5% |
| n | 220687 | 5.6% |
| i | 199614 | 5.1% |
| s | 175776 | 4.5% |
| t | 152876 | 3.9% |
| l | 146620 | 3.7% |
| Other values (77) | 1512764 |
None
| Value | Count | Frequency (%) |
| é | 8152 | |
| è | 2782 | 19.2% |
| ü | 545 | 3.8% |
| ó | 439 | 3.0% |
| ô | 325 | 2.2% |
| í | 240 | 1.7% |
| â | 224 | 1.5% |
| ä | 212 | 1.5% |
| ê | 188 | 1.3% |
| î | 157 | 1.1% |
| Other values (90) | 1202 | 8.3% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 672 | 8.7% |
| а | 659 | 8.6% |
| е | 597 | 7.8% |
| р | 501 | 6.5% |
| н | 480 | 6.2% |
| и | 443 | 5.8% |
| к | 398 | 5.2% |
| с | 358 | 4.7% |
| л | 268 | 3.5% |
| т | 261 | 3.4% |
| Other values (52) | 3044 |
Thai
| Value | Count | Frequency (%) |
| า | 22 | |
| ม | 21 | |
| ่ | 12 | 7.5% |
| ต | 11 | 6.8% |
| แ | 11 | 6.8% |
| ร | 11 | 6.8% |
| น | 6 | 3.7% |
| ้ | 6 | 3.7% |
| ฟ | 5 | 3.1% |
| อ | 5 | 3.1% |
| Other values (24) | 51 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 8 |
Arabic
| Value | Count | Frequency (%) |
| ا | 7 | |
| ك | 6 | |
| و | 5 | |
| ل | 3 | |
| ن | 2 | 7.1% |
| د | 1 | 3.6% |
| ، | 1 | 3.6% |
| ي | 1 | 3.6% |
| ف | 1 | 3.6% |
| ص | 1 | 3.6% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 | |
| — | 6 | |
| … | 4 | |
| • | 1 | 5.9% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| № | 4 |
Hebrew
| Value | Count | Frequency (%) |
| ק | 3 | |
| ו | 3 | |
| ה | 2 | |
| פ | 2 | |
| מ | 1 | 5.3% |
| ס | 1 | 5.3% |
| י | 1 | 5.3% |
| צ | 1 | 5.3% |
| ת | 1 | 5.3% |
| ל | 1 | 5.3% |
| Other values (3) | 3 |
CJK
| Value | Count | Frequency (%) |
| 山 | 2 | 2.9% |
| 可 | 2 | 2.9% |
| 田 | 2 | 2.9% |
| 生 | 2 | 2.9% |
| 日 | 2 | 2.9% |
| 口 | 2 | 2.9% |
| 鎌 | 2 | 2.9% |
| 今 | 2 | 2.9% |
| 郎 | 2 | 2.9% |
| 麥 | 2 | 2.9% |
| Other values (50) | 50 |
Hangul
| Value | Count | Frequency (%) |
| 농 | 2 | |
| 심 | 2 | |
| 빙 | 1 | |
| 그 | 1 | |
| 레 | 1 | |
| 연 | 1 | |
| 은 | 1 | |
| 자 | 1 | |
| 학 | 1 | |
| 송 | 1 | |
| Other values (2) | 2 |
Hiragana
| Value | Count | Frequency (%) |
| い | 1 |
Katakana
| Value | Count | Frequency (%) |
| ア | 1 | |
| リ | 1 | |
| グ | 1 | |
| ン | 1 | |
| ミ | 1 | |
| ナ | 1 | |
| ロ | 1 | |
| オ | 1 |
additives_n
Real number (ℝ)
ZEROS 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7051856 |
| Minimum | -1 |
|---|---|
| Maximum | 31 |
| Zeros | 82104 |
| Zeros (%) | 32.2% |
| Negative | 25244 |
| Negative (%) | 9.9% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 31 |
| Range | 32 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.5527784 |
|---|---|
| Coefficient of variation (CV) | 1.4970677 |
| Kurtosis | 6.8785431 |
| Mean | 1.7051856 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.0516666 |
| Sum | 434778 |
| Variance | 6.5166775 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 82104 | |
| 1 | 44028 | |
| 2 | 35091 | |
| -1 | 25244 | 9.9% |
| 3 | 22854 | 9.0% |
| 4 | 14640 | 5.7% |
| 5 | 10392 | 4.1% |
| 6 | 6876 | 2.7% |
| 7 | 4395 | 1.7% |
| 8 | 3171 | 1.2% |
| Other values (22) | 6179 | 2.4% |
| Value | Count | Frequency (%) |
| -1 | 25244 | 9.9% |
| 0 | 82104 | |
| 1 | 44028 | |
| 2 | 35091 | |
| 3 | 22854 | 9.0% |
| 4 | 14640 | 5.7% |
| 5 | 10392 | 4.1% |
| 6 | 6876 | 2.7% |
| 7 | 4395 | 1.7% |
| 8 | 3171 | 1.2% |
| Value | Count | Frequency (%) |
| 31 | 4 | < 0.1% |
| 29 | 2 | < 0.1% |
| 28 | 2 | < 0.1% |
| 27 | 2 | < 0.1% |
| 26 | 2 | < 0.1% |
| 25 | 11 | |
| 24 | 10 | < 0.1% |
| 23 | 14 | |
| 22 | 26 | |
| 21 | 20 |
energy_100g
Real number (ℝ)
ZEROS 
| Distinct | 3817 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1131.0096 |
| Minimum | 0 |
|---|---|
| Maximum | 3700 |
| Zeros | 6091 |
| Zeros (%) | 2.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 84 |
| Q1 | 393 |
| median | 1117 |
| Q3 | 1674 |
| 95-th percentile | 2389 |
| Maximum | 3700 |
| Range | 3700 |
| Interquartile range (IQR) | 1281 |
Descriptive statistics
| Standard deviation | 782.9411 |
|---|---|
| Coefficient of variation (CV) | 0.69224972 |
| Kurtosis | -0.50182767 |
| Mean | 1131.0096 |
| Median Absolute Deviation (MAD) | 657 |
| Skewness | 0.40962332 |
| Sum | 2.8837805 × 108 |
| Variance | 612996.77 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6091 | 2.4% |
| 2092 | 5074 | 2.0% |
| 1674 | 4010 | 1.6% |
| 1494 | 3912 | 1.5% |
| 1644 | 3276 | 1.3% |
| 1393 | 3219 | 1.3% |
| 1046 | 2943 | 1.2% |
| 1569 | 2824 | 1.1% |
| 1795 | 2347 | 0.9% |
| 1197 | 2312 | 0.9% |
| Other values (3807) | 218966 |
| Value | Count | Frequency (%) |
| 0 | 6091 | |
| 0.02 | 1 | < 0.1% |
| 0.42 | 1 | < 0.1% |
| 0.48 | 1 | < 0.1% |
| 0.6 | 1 | < 0.1% |
| 0.8 | 7 | < 0.1% |
| 0.9 | 4 | < 0.1% |
| 0.92 | 4 | < 0.1% |
| 1 | 49 | < 0.1% |
| 1.1 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3700 | 256 | |
| 3699 | 5 | < 0.1% |
| 3697 | 1 | < 0.1% |
| 3696 | 3 | < 0.1% |
| 3693 | 6 | < 0.1% |
| 3692 | 1 | < 0.1% |
| 3691 | 1 | < 0.1% |
| 3690 | 3 | < 0.1% |
| 3689 | 2 | < 0.1% |
| 3686 | 1 | < 0.1% |
salt_100g
Real number (ℝ)
ZEROS 
| Distinct | 5500 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5783321 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 36339 |
| Zeros (%) | 14.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.05334 |
| median | 0.56388 |
| Q3 | 1.36144 |
| 95-th percentile | 4 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 1.3081 |
Descriptive statistics
| Standard deviation | 6.2312293 |
|---|---|
| Coefficient of variation (CV) | 3.9479835 |
| Kurtosis | 141.59793 |
| Mean | 1.5783321 |
| Median Absolute Deviation (MAD) | 0.54388 |
| Skewness | 11.077406 |
| Sum | 402433.66 |
| Variance | 38.828219 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36339 | 14.3% |
| 0.01 | 3564 | 1.4% |
| 0.1 | 3305 | 1.3% |
| 1 | 2153 | 0.8% |
| 0.0254 | 2091 | 0.8% |
| 1.27 | 1938 | 0.8% |
| 1.63322 | 1824 | 0.7% |
| 0.127 | 1775 | 0.7% |
| 0.03 | 1558 | 0.6% |
| 0.02032 | 1537 | 0.6% |
| Other values (5490) | 198890 |
| Value | Count | Frequency (%) |
| 0 | 36339 | |
| 5 × 10-8 | 1 | < 0.1% |
| 9.999999 × 10-8 | 2 | < 0.1% |
| 1 × 10-6 | 1 | < 0.1% |
| 5 × 10-6 | 1 | < 0.1% |
| 7.874 × 10-6 | 1 | < 0.1% |
| 1 × 10-5 | 5 | < 0.1% |
| 1.3 × 10-5 | 4 | < 0.1% |
| 2 × 10-5 | 1 | < 0.1% |
| 2.413 × 10-5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 21 | < 0.1% |
| 99.93 | 1 | < 0.1% |
| 99.90582 | 111 | |
| 99.9 | 8 | < 0.1% |
| 99.822 | 5 | < 0.1% |
| 99.8 | 3 | < 0.1% |
| 99.78644 | 10 | < 0.1% |
| 99.64674 | 3 | < 0.1% |
| 99.568 | 1 | < 0.1% |
| 99.5 | 1 | < 0.1% |
sodium_100g
Real number (ℝ)
ZEROS 
| Distinct | 5251 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.64268795 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 36343 |
| Zeros (%) | 14.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.021 |
| median | 0.222 |
| Q3 | 0.536 |
| 95-th percentile | 1.583 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 0.515 |
Descriptive statistics
| Standard deviation | 2.6506021 |
|---|---|
| Coefficient of variation (CV) | 4.1242443 |
| Kurtosis | 162.08847 |
| Mean | 0.64268795 |
| Median Absolute Deviation (MAD) | 0.21412598 |
| Skewness | 11.525525 |
| Sum | 163868.72 |
| Variance | 7.0256914 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36343 | 14.3% |
| 0.003937007874 | 3559 | 1.4% |
| 0.03937007874 | 3290 | 1.3% |
| 0.3937007874 | 2138 | 0.8% |
| 0.01 | 2090 | 0.8% |
| 0.5 | 1936 | 0.8% |
| 0.643 | 1847 | 0.7% |
| 0.01181102362 | 1842 | 0.7% |
| 0.05 | 1775 | 0.7% |
| 0.008 | 1541 | 0.6% |
| Other values (5241) | 198613 |
| Value | Count | Frequency (%) |
| 0 | 36343 | |
| 1.968503937 × 10-8 | 1 | < 0.1% |
| 3.93700748 × 10-8 | 2 | < 0.1% |
| 3.937007874 × 10-7 | 1 | < 0.1% |
| 1.968503937 × 10-6 | 1 | < 0.1% |
| 3.1 × 10-6 | 1 | < 0.1% |
| 3.937007874 × 10-6 | 5 | < 0.1% |
| 5.118110236 × 10-6 | 4 | < 0.1% |
| 7.874015748 × 10-6 | 1 | < 0.1% |
| 9.5 × 10-6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 1 | |
| 92.5 | 1 | |
| 83 | 1 | |
| 75 | 2 | |
| 74 | 1 | |
| 71.429 | 1 | |
| 70 | 1 | |
| 62.5 | 1 | |
| 60.3 | 1 | |
| 59 | 1 |
fiber_100g
Real number (ℝ)
ZEROS 
| Distinct | 1010 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2069436 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 124820 |
| Zeros (%) | 49.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.3 |
| Q3 | 3 |
| 95-th percentile | 9.6 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 4.219424 |
|---|---|
| Coefficient of variation (CV) | 1.9118857 |
| Kurtosis | 58.132747 |
| Mean | 2.2069436 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 5.4282559 |
| Sum | 562713.25 |
| Variance | 17.803539 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 124820 | |
| 3.6 | 8511 | 3.3% |
| 3.3 | 3986 | 1.6% |
| 1.8 | 3874 | 1.5% |
| 0.8 | 3799 | 1.5% |
| 7.1 | 3703 | 1.5% |
| 1.6 | 3413 | 1.3% |
| 2 | 3399 | 1.3% |
| 1.2 | 3259 | 1.3% |
| 2.4 | 3221 | 1.3% |
| Other values (1000) | 92989 |
| Value | Count | Frequency (%) |
| 0 | 124820 | |
| 0.0001 | 2 | < 0.1% |
| 0.0002 | 1 | < 0.1% |
| 0.001 | 16 | < 0.1% |
| 0.002 | 3 | < 0.1% |
| 0.004 | 1 | < 0.1% |
| 0.00416 | 1 | < 0.1% |
| 0.005 | 2 | < 0.1% |
| 0.01 | 72 | < 0.1% |
| 0.016 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 10 | |
| 99 | 1 | < 0.1% |
| 94.8 | 1 | < 0.1% |
| 92.4 | 1 | < 0.1% |
| 90 | 1 | < 0.1% |
| 88 | 2 | < 0.1% |
| 87.5 | 1 | < 0.1% |
| 87 | 1 | < 0.1% |
| 86.2 | 1 | < 0.1% |
| 85.2 | 1 | < 0.1% |
sugars_100g
Real number (ℝ)
ZEROS 
| Distinct | 4052 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.10314 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 50158 |
| Zeros (%) | 19.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.8 |
| median | 4.8 |
| Q3 | 22.22 |
| 95-th percentile | 60.774 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 21.42 |
Descriptive statistics
| Standard deviation | 20.84872 |
|---|---|
| Coefficient of variation (CV) | 1.3804228 |
| Kurtosis | 2.4900988 |
| Mean | 15.10314 |
| Median Absolute Deviation (MAD) | 4.8 |
| Skewness | 1.7375937 |
| Sum | 3850908 |
| Variance | 434.66911 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 50158 | 19.7% |
| 3.57 | 7142 | 2.8% |
| 0.5 | 4359 | 1.7% |
| 3.33 | 3700 | 1.5% |
| 1 | 2567 | 1.0% |
| 20 | 2323 | 0.9% |
| 6.67 | 2264 | 0.9% |
| 10 | 2168 | 0.9% |
| 50 | 2101 | 0.8% |
| 7.14 | 2024 | 0.8% |
| Other values (4042) | 176168 |
| Value | Count | Frequency (%) |
| 0 | 50158 | |
| 0.0001 | 8 | < 0.1% |
| 0.0005 | 1 | < 0.1% |
| 0.001 | 24 | < 0.1% |
| 0.0019 | 2 | < 0.1% |
| 0.0048 | 1 | < 0.1% |
| 0.007 | 1 | < 0.1% |
| 0.01 | 85 | < 0.1% |
| 0.0108 | 1 | < 0.1% |
| 0.02 | 27 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 921 | |
| 99.95 | 1 | < 0.1% |
| 99.9 | 6 | < 0.1% |
| 99.8 | 3 | < 0.1% |
| 99.7 | 10 | < 0.1% |
| 99.6 | 4 | < 0.1% |
| 99.5 | 10 | < 0.1% |
| 99.3 | 2 | < 0.1% |
| 99.2 | 3 | < 0.1% |
| 99 | 40 | < 0.1% |
fat_100g
Real number (ℝ)
ZEROS 
| Distinct | 3370 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.979729 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 78545 |
| Zeros (%) | 30.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 3.79 |
| Q3 | 19 |
| 95-th percentile | 45 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 17.247552 |
|---|---|
| Coefficient of variation (CV) | 1.439728 |
| Kurtosis | 6.5670472 |
| Mean | 11.979729 |
| Median Absolute Deviation (MAD) | 3.79 |
| Skewness | 2.2702762 |
| Sum | 3054519.5 |
| Variance | 297.47804 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 78545 | |
| 25 | 3375 | 1.3% |
| 32.14 | 2978 | 1.2% |
| 0.5 | 2967 | 1.2% |
| 20 | 2656 | 1.0% |
| 1.79 | 2526 | 1.0% |
| 28.57 | 2459 | 1.0% |
| 21.43 | 2411 | 0.9% |
| 0.1 | 2367 | 0.9% |
| 10 | 2238 | 0.9% |
| Other values (3360) | 152452 |
| Value | Count | Frequency (%) |
| 0 | 78545 | |
| 0.0001 | 2 | < 0.1% |
| 0.000133 | 1 | < 0.1% |
| 0.001 | 1 | < 0.1% |
| 0.003 | 1 | < 0.1% |
| 0.004 | 2 | < 0.1% |
| 0.005 | 3 | < 0.1% |
| 0.007 | 1 | < 0.1% |
| 0.01 | 42 | < 0.1% |
| 0.012 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 1280 | |
| 99.9 | 16 | < 0.1% |
| 99.85 | 1 | < 0.1% |
| 99.82 | 1 | < 0.1% |
| 99.8 | 17 | < 0.1% |
| 99.7 | 5 | < 0.1% |
| 99.4 | 5 | < 0.1% |
| 99 | 5 | < 0.1% |
| 98.73 | 1 | < 0.1% |
| 98.5 | 1 | < 0.1% |
saturated_fat_100g
Real number (ℝ)
ZEROS 
| Distinct | 2192 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5437063 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 96736 |
| Zeros (%) | 37.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 6.58 |
| 95-th percentile | 19 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 6.58 |
Descriptive statistics
| Standard deviation | 7.6251009 |
|---|---|
| Coefficient of variation (CV) | 1.6781677 |
| Kurtosis | 23.985088 |
| Mean | 4.5437063 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.6365845 |
| Sum | 1158527 |
| Variance | 58.142164 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 96736 | |
| 0.1 | 5227 | 2.1% |
| 3.57 | 3480 | 1.4% |
| 0.5 | 3079 | 1.2% |
| 7.14 | 2875 | 1.1% |
| 0.2 | 2558 | 1.0% |
| 1 | 2335 | 0.9% |
| 0.3 | 2302 | 0.9% |
| 3.33 | 2211 | 0.9% |
| 1.79 | 2189 | 0.9% |
| Other values (2182) | 131982 |
| Value | Count | Frequency (%) |
| 0 | 96736 | |
| 0.0001 | 11 | < 0.1% |
| 0.001 | 30 | < 0.1% |
| 0.002 | 10 | < 0.1% |
| 0.003 | 4 | < 0.1% |
| 0.0032 | 1 | < 0.1% |
| 0.004 | 3 | < 0.1% |
| 0.005 | 11 | < 0.1% |
| 0.006 | 2 | < 0.1% |
| 0.00667 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 12 | |
| 99.9 | 1 | < 0.1% |
| 99 | 2 | < 0.1% |
| 98 | 1 | < 0.1% |
| 96 | 2 | < 0.1% |
| 95.5 | 1 | < 0.1% |
| 95 | 5 | |
| 94 | 2 | < 0.1% |
| 93.8 | 1 | < 0.1% |
| 93.33 | 3 | < 0.1% |
cholesterol_100g
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 535 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.011335686 |
| Minimum | 0 |
|---|---|
| Maximum | 95.238 |
| Zeros | 200361 |
| Zeros (%) | 78.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.071 |
| Maximum | 95.238 |
| Range | 95.238 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.26935215 |
|---|---|
| Coefficient of variation (CV) | 23.761433 |
| Kurtosis | 91156.825 |
| Mean | 0.011335686 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 293.63875 |
| Sum | 2890.3053 |
| Variance | 0.07255058 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 200361 | |
| 0.071 | 2461 | 1.0% |
| 0.107 | 2235 | 0.9% |
| 0.012 | 1908 | 0.7% |
| 0.089 | 1662 | 0.7% |
| 0.054 | 1651 | 0.6% |
| 0.018 | 1587 | 0.6% |
| 0.004 | 1503 | 0.6% |
| 0.036 | 1385 | 0.5% |
| 0.008 | 1209 | 0.5% |
| Other values (525) | 39012 | 15.3% |
| Value | Count | Frequency (%) |
| 0 | 200361 | |
| 4.5 × 10-5 | 1 | < 0.1% |
| 7.1 × 10-5 | 1 | < 0.1% |
| 0.0001 | 5 | < 0.1% |
| 0.0002 | 5 | < 0.1% |
| 0.0004 | 1 | < 0.1% |
| 0.000416 | 1 | < 0.1% |
| 0.00046 | 1 | < 0.1% |
| 0.0005 | 2 | < 0.1% |
| 0.0008 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 95.238 | 1 | |
| 70.588 | 1 | |
| 62.5 | 1 | |
| 13.846 | 1 | |
| 10.9 | 1 | |
| 1.58 | 1 | |
| 1.291 | 1 | |
| 1.25 | 1 | |
| 1.081 | 1 | |
| 0.996 | 1 |
nutrition_grade_fr
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| a | |
|---|---|
| d | |
| c | |
| e | |
| b |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 254974 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | d |
|---|---|
| 2nd row | b |
| 3rd row | d |
| 4th row | a |
| 5th row | a |
Common Values
| Value | Count | Frequency (%) |
| a | 72728 | |
| d | 62019 | |
| c | 44924 | |
| e | 42377 | |
| b | 32926 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 72728 | |
| d | 62019 | |
| c | 44924 | |
| e | 42377 | |
| b | 32926 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 72728 | |
| d | 62019 | |
| c | 44924 | |
| e | 42377 | |
| b | 32926 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 254974 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 72728 | |
| d | 62019 | |
| c | 44924 | |
| e | 42377 | |
| b | 32926 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 254974 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 72728 | |
| d | 62019 | |
| c | 44924 | |
| e | 42377 | |
| b | 32926 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 254974 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 72728 | |
| d | 62019 | |
| c | 44924 | |
| e | 42377 | |
| b | 32926 |
nutrition_score_uk_100g
Real number (ℝ)
ZEROS 
| Distinct | 22209 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.0157221 |
| Minimum | -15 |
|---|---|
| Maximum | 40 |
| Zeros | 12521 |
| Zeros (%) | 4.9% |
| Negative | 37141 |
| Negative (%) | 14.6% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | -15 |
|---|---|
| 5-th percentile | -4 |
| Q1 | 2 |
| median | 9 |
| Q3 | 16 |
| 95-th percentile | 24 |
| Maximum | 40 |
| Range | 55 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 8.7620999 |
|---|---|
| Coefficient of variation (CV) | 0.97186889 |
| Kurtosis | -0.86589005 |
| Mean | 9.0157221 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.19504308 |
| Sum | 2298774.7 |
| Variance | 76.774394 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 12521 | 4.9% |
| 1 | 11784 | 4.6% |
| 2 | 10932 | 4.3% |
| 14 | 10565 | 4.1% |
| -1 | 8730 | 3.4% |
| 13 | 8309 | 3.3% |
| 12 | 8140 | 3.2% |
| 11 | 7980 | 3.1% |
| 3 | 7489 | 2.9% |
| 20 | 7301 | 2.9% |
| Other values (22199) | 161223 |
| Value | Count | Frequency (%) |
| -15 | 12 | < 0.1% |
| -14 | 5 | < 0.1% |
| -13 | 23 | < 0.1% |
| -12.98946843 | 1 | < 0.1% |
| -12 | 46 | |
| -11.89017976 | 1 | < 0.1% |
| -11.71202015 | 1 | < 0.1% |
| -11.11047737 | 1 | < 0.1% |
| -11.02351469 | 1 | < 0.1% |
| -11 | 90 |
| Value | Count | Frequency (%) |
| 40 | 33 | |
| 39.62928274 | 27 | |
| 39.57922926 | 14 | |
| 38.90178138 | 7 | < 0.1% |
| 38.74275539 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 37.86717433 | 3 | < 0.1% |
| 37.22136799 | 1 | < 0.1% |
| 37.13967297 | 9 | < 0.1% |
| 37 | 2 | < 0.1% |
nutrition_score_fr_100g
Real number (ℝ)
ZEROS 
| Distinct | 22224 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.1377468 |
| Minimum | -15 |
|---|---|
| Maximum | 40 |
| Zeros | 11704 |
| Zeros (%) | 4.6% |
| Negative | 35481 |
| Negative (%) | 13.9% |
| Memory size | 3.9 MiB |
Quantile statistics
| Minimum | -15 |
|---|---|
| 5-th percentile | -4 |
| Q1 | 2 |
| median | 9 |
| Q3 | 15.613139 |
| 95-th percentile | 24 |
| Maximum | 40 |
| Range | 55 |
| Interquartile range (IQR) | 13.613139 |
Descriptive statistics
| Standard deviation | 8.6223733 |
|---|---|
| Coefficient of variation (CV) | 0.9435995 |
| Kurtosis | -0.81166515 |
| Mean | 9.1377468 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.16934547 |
| Sum | 2329887.9 |
| Variance | 74.345322 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 11704 | 4.6% |
| 1 | 11130 | 4.4% |
| 14 | 11124 | 4.4% |
| 2 | 10457 | 4.1% |
| 13 | 8725 | 3.4% |
| -1 | 8707 | 3.4% |
| 12 | 8558 | 3.4% |
| 11 | 8537 | 3.3% |
| 3 | 7725 | 3.0% |
| 15 | 7457 | 2.9% |
| Other values (22214) | 160850 |
| Value | Count | Frequency (%) |
| -15 | 1 | < 0.1% |
| -14.56626981 | 1 | < 0.1% |
| -14.53003013 | 1 | < 0.1% |
| -14.43958461 | 1 | < 0.1% |
| -14.42682726 | 1 | < 0.1% |
| -14.42473158 | 2 | < 0.1% |
| -14.38149172 | 2 | < 0.1% |
| -14.34924791 | 3 | < 0.1% |
| -14 | 5 | < 0.1% |
| -13 | 23 |
| Value | Count | Frequency (%) |
| 40 | 4 | < 0.1% |
| 38.5578851 | 1 | < 0.1% |
| 38.51842973 | 1 | < 0.1% |
| 38.49152462 | 3 | < 0.1% |
| 38.4492283 | 1 | < 0.1% |
| 38.44860451 | 1 | < 0.1% |
| 38.44609177 | 20 | |
| 38.43481639 | 2 | < 0.1% |
| 38.41384923 | 1 | < 0.1% |
| 38.09412942 | 27 |
| code | countries_fr | product_name | brands | additives_n | energy_100g | salt_100g | sodium_100g | fiber_100g | sugars_100g | fat_100g | saturated_fat_100g | cholesterol_100g | nutrition_grade_fr | nutrition_score_uk_100g | nutrition_score_fr_100g | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0000000004530 | États-Unis | Banana Chips Sweetened (Whole) | NaN | 0.0 | 2243.0 | 0.00000 | 0.000 | 3.6 | 14.29 | 28.57 | 28.57 | 0.018 | d | 14.000000 | 14.000000 |
| 1 | 0000000004559 | États-Unis | Peanuts | Torn & Glasser | 0.0 | 1941.0 | 0.63500 | 0.250 | 7.1 | 17.86 | 17.86 | 0.00 | 0.000 | b | 0.000000 | 0.000000 |
| 2 | 0000000016087 | États-Unis | Organic Salted Nut Mix | Grizzlies | 0.0 | 2540.0 | 1.22428 | 0.482 | 7.1 | 3.57 | 57.14 | 5.36 | 0.000 | d | 12.000000 | 12.000000 |
| 3 | 0000000016094 | États-Unis | Organic Polenta | Bob's Red Mill | 0.0 | 1552.0 | 0.00000 | 0.000 | 5.7 | 0.00 | 1.43 | 0.00 | 0.000 | a | 5.344733 | 5.323729 |
| 4 | 0000000016100 | États-Unis | Breadshop Honey Gone Nuts Granola | Unfi | 0.0 | 1933.0 | 0.00000 | 0.000 | 7.7 | 11.54 | 18.27 | 1.92 | 0.000 | a | 8.012207 | 7.898541 |
| 5 | 0000000016117 | États-Unis | Organic Long Grain White Rice | Lundberg | 0.0 | 1490.0 | 0.00000 | 0.000 | 0.0 | 0.00 | 0.00 | 0.00 | 0.000 | a | 6.827355 | 6.809392 |
| 6 | 0000000016124 | États-Unis | Organic Muesli | Daddy's Muesli | 2.0 | 1833.0 | 0.13970 | 0.055 | 9.4 | 15.62 | 18.75 | 4.69 | 0.000 | c | 7.000000 | 7.000000 |
| 7 | 0000000016193 | États-Unis | Organic Dark Chocolate Minis | Equal Exchange | 0.0 | 2406.0 | 0.00000 | 0.000 | 7.5 | 42.50 | 37.50 | 22.50 | 0.000 | a | 18.277754 | 18.060035 |
| 8 | 0000000016513 | États-Unis | Organic Sunflower Oil | Napa Valley Naturals | 0.0 | 3586.0 | 0.00000 | 0.000 | 0.0 | 0.00 | 100.00 | 7.14 | 0.000 | a | 18.822279 | 17.765301 |
| 9 | 0000000016612 | États-Unis | Organic Adzuki Beans | Unfi | 0.0 | 1393.0 | 0.00000 | 0.000 | 12.5 | 0.00 | 1.04 | 0.00 | 0.000 | a | 2.874713 | 2.929574 |
| code | countries_fr | product_name | brands | additives_n | energy_100g | salt_100g | sodium_100g | fiber_100g | sugars_100g | fat_100g | saturated_fat_100g | cholesterol_100g | nutrition_grade_fr | nutrition_score_uk_100g | nutrition_score_fr_100g | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 258313 | 9780803738782 | États-Unis | Organic Z Bar | Clif Kid | 1.0 | 1393.0 | 0.95250 | 0.375000 | 8.3 | 30.56 | 9.72 | 2.78 | 0.0 | d | 11.000000 | 11.000000 |
| 258314 | 9782211109758 | France | Verrine Cheescake Myrtille | Kayser | -1.0 | 1084.0 | 0.29000 | 0.114173 | 0.0 | 10.50 | 0.00 | 12.00 | 0.0 | d | 16.000000 | 16.000000 |
| 258315 | 9782401029101 | France | Fiche Brevet | Hatier | -1.0 | 4.0 | 10.00000 | 3.937008 | 10.0 | 1.00 | 0.00 | 1.00 | 0.0 | b | 0.000000 | 0.000000 |
| 258316 | 9787461062105 | États-Unis | Natural Cassava | Industria De Casabe Paul | 0.0 | 1477.0 | 0.03048 | 0.012000 | 4.7 | 2.35 | 0.00 | 0.00 | 0.0 | a | -1.000000 | -1.000000 |
| 258317 | 9836654056565 | États-Unis | Raspados Ice Bars | Jarritos, The Jel Sert Company | 8.0 | 368.0 | 0.04572 | 0.018000 | 0.0 | 19.30 | 0.00 | 0.00 | 0.0 | a | 7.139006 | 7.568028 |
| 258318 | 9847548283004 | France | Tartines craquantes bio au sarrasin | Le Pain des fleurs | -1.0 | 1643.0 | 0.68000 | 0.267717 | 5.9 | 2.60 | 2.80 | 0.60 | 0.0 | a | -4.000000 | -4.000000 |
| 258319 | 989898 | Suisse | Test NF App | NaN | 0.0 | 569.0 | 1.10000 | 0.433071 | 1.1 | 9.60 | 31.00 | 0.00 | 0.0 | a | 6.661093 | 6.888682 |
| 258320 | 9900000000233 | France | Amandes | Biosic | -1.0 | 2406.0 | 0.10000 | 0.039370 | 12.2 | 3.89 | 0.00 | 3.73 | 0.0 | b | 0.000000 | 0.000000 |
| 258321 | 99111250 | France | Thé vert Earl grey | Lobodis | 0.0 | 21.0 | 0.02540 | 0.010000 | 0.2 | 0.50 | 0.20 | 0.20 | 0.0 | c | 0.000000 | 2.000000 |
| 258322 | 999990026839 | États-Unis | Sugar Free Drink Mix, Peach Tea | Market Pantry | 7.0 | 2092.0 | 0.00000 | 0.000000 | 0.0 | 0.00 | 0.00 | 0.00 | 0.0 | a | 9.748240 | 9.501761 |